# Multimodal audio processing
Kimi Audio 7B
MIT
Kimi-Audio is an open-source foundational audio model that excels in audio understanding, generation, and dialogue.
Speech Recognition Supports Multiple Languages
K
moonshotai
55
15
Pathumma Llm Audio 1.0.0
Apache-2.0
Pathumma-llm-audio-1.0.0 is an 8-billion-parameter Thai large language model specifically designed for audio comprehension tasks, capable of processing various audio inputs including speech, general audio, and music.
Audio-to-Text
Transformers Supports Multiple Languages

P
nectec
333
7
Featured Recommended AI Models